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The conversation starts here. 


Thinking about Data 


BY CLAY SHIRKY 


We could imagine no one better suited to write an issue on informa- 
tion visualization than Clay Shirky. The way he thinks about things 
and then describes them so that others can see them - with or without 
charts - is a model of the art. We first heard him brainstorm about 
peer-to-peer software at the Aspen Institute in the summer of 2000. 
More recently, he limned the difference between a community and an 
audience (broadly, many to many vs. many from one, but it was the 
drawing that made it all clear). 


As Clay notes in the clear-sighted roundup below, the tools and the 
user environment to take advantage of information visualization are 
both improving. The challenge is for people to think hard and elegant- 
ly about information design... It’s something Clay himself does strik- 
ingly well, but we still need him in his day job, as adjunct professor at 
NYU's ITP program and consultant to businesses grappling with 
issues of technological decentralization. 


- Esther Dyson 


We are such visual creatures that we often discuss thought in visual 
terms: We have points of view; we change our perspective; we say 
“See what I’m saying?” and “Ah, that clears it up.” This experience is 
active as well; we use our ability to see as a way to help us think and 
create structure. From papyrus to the whiteboard, we have always 
used visual tools as aids to cognition. 


Computers have been able to produce graphical representations 


since the first ASCII art, but most computer displays are still text- 
heavy, with options limited to simple icons and size or variety of 
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fonts. Our lives abound with mundane problems of display (“Which 
PowerPoint template should I use?”), and the designers of our appli- 
cations spend an enormous amount of time designing graphical 
user interfaces. These visual aids — everything from desktop icons to 
bar and pie charts — are relatively straightforward, and have become 
part of daily life. 


Information visualization differs from these forms of graphic com- 
munication in several important ways. Whereas most of the visual 
tools we use are designed for presentation of known objects or 
quantities — a file icon or a chart of sales figures — information visu- 
alization is more open-ended. It is a way of using graphical tools to 
see things previously unseen: structures, relationships or data 
obscured by other data. 


When you set out to create a chart, you know the nature of the fig- 
ures to be represented in advance; the chart is a graphic recreation of 
those existing numbers. When you are staring at a pad of paper or 
standing at a whiteboard, however, the visualization is part of the 
process. Information visualization tools are visual by definition and 
interactive in implementation; they work more like the whiteboard 
than the bar chart. A user of a visualization tool is not simply pre- 
sented with static data; instead, the user can explore a set of data 
interactively, by changing the data modeling or the hypotheses being 
tested and seeing the resulting visual presentation. This feedback 
loop is part of the exploration; the tools support the process rather 
than simply expressing the results. 


Information visualization vs. graphical presentation 

In order to allow a user to have a conversation with the data, infor- 
mation visualization tools do things that even the most sophisticat- 
ed charting and graphing packages can't. First of all, the techniques 
used make it easier to handle multi-variate and dense sets of data in 
a comprehensible way, thus helping to make invisible correlations 
visible. Instead of the standard menu of bar chart/line chart/pie 
chart, these tools offer presentation methods customized for partic- 
ular domains and densities of information. The degree of cus- 
tomization of the tool also gives the user a greater degree of freedom 
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to change the axes ordering the data, or to highlight or suppress certain kinds of data, 
in order to explore the underlying relationships. 


A number of different data comparisons could be done in a multi-column spread- 
sheet, of course, but placing these myriad comparisons into one visual interface pro- 
duces something no spreadsheet can: the gestalt of the data. In the same way that a 
chord is more than its individual notes and a family is more than its individual mem- 
bers, the overall pattern of data often exhibits patterns that emerge from the individ- 
ual pieces of data, patterns that can be impossible to discern by merely re-sorting the 
data in spreadsheet format. 


Second, information visualization tools treat the graphical aspects of the display as 
the critical element of the interface. Even the most heavily graphical Web interface or 
operating system GUL is essentially decorated text, with the graphical elements usual- 
ly acting as little more than icons with text labels —“Home’, “Products” and “Contact 
Us” or “Presentation.ppt” and “ToDo.doc”. (Imagine trying to use a purely graphical- 
ly expressed desktop, with no text at all; other than dragging unlabeled files to the 
trash, yowd find it impossible to get anything done.) Information visualization tech- 
niques, by contrast, make the graphical presentation of data the most important ele- 
ment, typically using text sparingly, as labels within visually important fields. 


Information visualization techniques offer a more varied palette of visual character- 
istics than the typical interface: Characteristics such as color, brightness, contrast, 
surface area, line weight, and visual pattern, among many others, can be used to rep- 
resent differences between pieces of data. Indeed, one of the great challenges of infor- 
mation visualization is knowing when to say no to this embarrassment of riches — the 
hardest choices are often among the few visual aspects that will best represent the 
underlying data set in a way that’s easy to grasp. Does the thickness of the line repre- 
sent the age of the customer or cumulative revenues? Does brightness indicate activi- 
ty or proximity? 


Third, and most important, information visualization provides interfaces for asking 
questions of the data. The key element separating it from mere graphic presentation 
is interactivity. In their introductory chapter to Readings in Information Visuali- 
zation (SEE RESOURCES, PAGE 33), Ben Shneiderman of the Human-Computer Inter- 
action Laboratory at the University of Maryland and Stuart Card of Xerox PARC 
write that interactivity is an essential aspect of information visualization, defining it 
as the “use of interactive visual representations of abstract, nonphysically-based data 
to amplify cognition.” Whereas charts and graphs present known data in a frozen 
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scene, information visualization is a way of letting the user query the data in real 
time — not by changing the data (which presumably was collected some time ago), 
but by digging into the data and changing its representation. 


A good information interface lets users manipulate what they see in order to help 
them ask better questions about the data shown, and new answers beget new ques- 
tions. The canonical first reaction to a graphic presentation is “Oh, now I see what I 
knew.” The canonical first reaction to information visualization, however, is closer to 
“Hmm, that’s funny. I see something I didn’t know before. What’s going on there?” 


Visual revolutions 

The opportunity here is enormous. Two previous revolutions have relied on visual 
presentation — the spread of PCs, which began reaching a much wider audience with 
the invention of the GUI, and the spread of the Web, after Mosaic turned it into a tool 
for “the rest of us.” The need is enormous as well. When we shift from our desktop to 
the Web, we go from a small and highly controlled environment to a vast, chaotic 
one, and yet the visual tools for handling the complexities of the Web are much less 
well-developed than the desktop. Even the desktop is a glorified filing system, all but 
useless for extracting patterns from large amounts of data. Likewise, our ability to 
handle the large and growing quantities of data stored in databases is very limited, as 
most of the tools we have for searching or extracting information are visually thin. 


Improving the ability of the average employee to handle larger amounts of data on 
their own has been immensely valuable in the past, as with the spread of tools like 
VisiCalc or Microsoft Access. 


Information visualization has the potential to accomplish both things: to provide a 
new visual language for letting the average user get their work done, while at the 
same time increasing the amount of work they can do. In the past, the spread of 
information visualization has been hampered by the GIGO problem: Garbage In, 
Garbage Out. Tools that help users visualize the structure of data are useless if the 
data is unstructured. There’s also the less well-known NINO problem: Nothing In, 
Nothing Out. Organizations that are not aggressive about collecting, indexing, and 
storing data can’t visualize it. 


Both of these difficulties are abating, thanks to the widespread use of databases as 


standard storage tools. Even in companies that have never considered using infor- 
mation visualization tools, more of their data is stored in some way that imposes 
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some structure. Furthermore, as companies design solutions for indexing the users’ 
email boxes and randomly named files (sEE RELEASE 1.0, JUNE 2001), an increasing 
amount of structure is being created at the level of the individual PC. Finally, the 
spread of XML (whose co-inventor, Tim Bray, is a founder of Antarctica Systems, 
profiled below) provides a format that allows a wide variety of tools to begin to 
interoperate with one another. (Semio’s Tagger can provide output to Antarctica’s 
Visual Net, Inxight OEMs its solution to other software vendors, and so on.) 


The users need something to help them handle more information. Businesses have 
always profited from tools that do that. The tools to generate the kind of structured 
data information visualization software needs are now widespread. All that’s needed 
now is someone to figure out how to put the pieces together. 


No magic bullet 

This is not to say that information visualization tools don’t face problems that have 
kept them from mainstream use. The most obvious one is the screen. The surface 
area of the largest commodity PC screen is only a fraction of the size and visual den- 
sity of the front page of a newspaper, and unlike chips and disks, screen size and res- 
olution are not improving very quickly. Some industries, such as finance, get around 
this problem by equipping their employees with two or more large screens, but that’s 
beyond the financial reach of most businesses; meanwhile, work such as that in 
IBM’s Roentgen Project, which seeks to double screen resolution, is still in the lab. 
IDC estimates that by the end of 2003, 75 percent of the installed base of monitors 
will still be 17-inch or less. Anyone wanting to offer information visualization prod- 
ucts or services to a large audience has to target display technology that is going to 
see only incremental short-term improvement. 


The second problem is human work patterns. As mentioned above, information 
visualization tools can handle multi-variate data sets by mapping different axes of 
data to things like line weight, surface pattern, and motion. However, the resulting 
interfaces can often leave the user spending more time trying to understand the 
interface than the data itself. As Esther Dyson herself has pointed out, “When we say 
‘intuitive’ we often mean ‘familiar’ Information visualization tools create symbolic 
overhead: a user has to learn new interfaces, software behaviors or visual symbols, 
and to the degree that these interfaces are unfamiliar, they feel unintuitive. (Firms 
offering information visualization tools typically work to convert the esoteric inter- 


faces developed in R&D environments to something the user can easily understand.) 
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There is also the question of skill in visual presentation by the end users. Computers 
make it easier to make bad visual presentations than to make good ones: After the 
arrival of desktop publishing, the simplest birthday party invitation usually arrived 
in five different fonts. Graphics tools like Photoshop and presentation tools like 
PowerPoint have had a similar effect, vastly increasing the number of people capable 
of creating graphics without doing anything to increase the number of people capa- 
ble of creating useful graphics. Given these restrictions, information visualization is 
usually sold as both a product and a service, with per-client customization and some 
degree of user training as a core part of any contract. 


Chartjunk goes digital 

The result today is the computerized equivalent of what Edward Tufte, author of a 
trio of books endorsing a lean style of information presentation, calls “chartjunk,” 
the elements in a visual presentation that don’t convey any information. They are 
gratuitous symbols or decorations, background images, even animations, all added 
to create some “effect” other than helping the user understand the underlying infor- 
mation. (Look at almost any issue of USA Today for an example.) In his books, he 
argues that the essence of good graphical presentation is the encoding of multiple 
variables into a single display of information in a way that rewards the user for 
spending time looking, rather than confusing or distracting him. 


Tufte cites a number of examples that show the power of information density. The 
most famous is a 19th-century timeline of Napoleon’s 1812 advance into Russia and 
his subsequent retreat, superimposed on a simple map of the region. The map uses a 
single band to trace Napoleon’s path, with the thickness of the band representing the 
number of soldiers. Along the bottom of the map, the temperature is shown by date. 
The band starts thick, showing nearly half a million soldiers setting out at the begin- 
ning of the campaign, but thins as Napoleon marches towards Moscow, and thins 
further on his return. The most dramatic aspect of the map is seeing the way the 
band abruptly contracts as the army fords the Berezina River, representing thou- 
sands of deaths in a single day. At the end of the retreat, Napoleon has only 10,000 
soldiers left, shown in a band a fraction of the width of the original. There are six 
dimensions of data on this two-dimensional chart — the size of the army, latitude 
and longitude of the map, time, direction of the march, and temperature. 


Tufte also offers several egregious examples of chartjunk, as with a graphic that 
traces the sharp rise and fall of the price of diamonds along the fishnet-clad leg of a 
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cartoon, Marilyn-esque chorus girl. Needless to say, the numerical data concerning 
the price of diamonds, the ostensible purpose of the graphic, is obscured. 


Tufte’s best-known work focuses on printed material, but his work is exceptionally 
relevant to digital displays of information because of their restricted display space. 

In this environment, increasing density and clarity without redundancy is a critical 
need. As he puts it, “Good design is clear thinking made visible.” 


Information Visualization in the Real World 


Information visualization has existed as a separate area of inquiry since the 1987 
National Science Foundation report, “Visualization in Scientific Computing”. In the 
15 years since, a variety of new techniques have been invented in various research 
labs, such as the HCIL at the University of Maryland and Xerox PARC. However, the 
commercial successes in information visualization have been few and far between. 


That may be changing, as there are now several companies selling practical software 
to skeptical businesses. Many of these businesses started by offering general-purpose 
tools, but have since adopted a strategy of focusing on particular market verticals. 
Financial services and life sciences firms are the two most common verticals, because 
they are so data intensive. And the current generation of tools is starting to create 
some impressive gains. Andarko, an oil exploration firm, uses Spotfire’s “Guided 
Analytics” tools (Pace 21) to help analyze thousands of possible economic scenarios 
simultaneously, with results that are both better and faster than those from analyz- 
ing the same data in tabular format. (In one instance, the ability to visualize the data 
alerted them to fatal weaknesses in a drilling opportunity that would have cost them 
$9 million to pursue.) 


Though the approaches to information visualization discussed below differ consid- 
erably, they have in common the idea that to get information visualization out into 
the world, the tools must be simple enough to use without significant retraining, 
and be tolerant of partial or incomplete data sets. 


Anatomy of a visualization tool: Smart Money's Market Map 
Frank Zappa once said that writing about music was like dancing about architecture. 
He could have said the same for writing about visual tools. To see the kinds of prob- 
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lems visualization can solve, it helps to look at a visualization tool, so fire up your 
browser as you read this and visit the URLs listed throughout. 


The Market Map (SEE FIGURE 1 IN THE COLOR INSERT; WWW.SMARTMONEY.COM/MARKETMAP/), 
developed by Martin Wattenberg for Smart Money magazine’s Website, is deceptively 
simple. It shows roughly 600 publicly traded companies, chosen by Smart Money’s 
editors for their importance and grouped by sector, in a format called a “heat map.” 
This presentation is meant to allow for quick comparisons among many data points 
based on size and “heat,” in this case defined as the increase or decrease in stock 
price. (The heat map concept is itself derived from work on tree maps, which came 
out of Ben Shneiderman’s visualization lab at the University of Maryland (see sipe- 
BAR PAGE 24).) 


On the Market Map, each company is represented by a colored rectangle. The size of 
each rectangle is proportional to the company’s market capitalization, and the color, 
which ranges from red through brown to green, represents the percentage change in 
stock price. Mousing over each rectangle will give you the company’s name and per- 
formance numbers. Clicking on the rectangle lets you drill down into a variety of 
additional information. 


And that’s it: just two visual variables, surface area and color. And yet, with one 
interface, the map shows you an enormous range of comparisons: the relative mar- 
ket cap of all companies in a particular sector and across the map as a whole, as well 
as the relative market caps of the various sectors compared with one another and 
with the whole market. It also shows you the session-on-session performance of 
each company as well as the comparative performance of individual companies to 
their sector group, to other sector groups, and to the market as a whole, the relative 
performance of individual sector groups to one another and to the market as a 
whole, and the overall behavior of the market. And of course, you can rescale the 
variables, e.g. switching from daily performance to weekly, and scaling the percent- 
age changes up or down to adjust for market volatility. 


The Market Map lets you feel the market in a way that disaggregated data can’t. The 
gestalt of the map lets you see, in a moment, whether the index in question is wildly 
up everywhere (two years ago!), broadly but weakly down with some sharp decliners 
(typical of 2002), or uniformly bad with one or two standouts (financial institutions 
in the middle of 2002). The overall market pattern is an emergent property of the 
behavior of its individual elements, and there are many ways of displaying either the 
behavior of the elements or the behavior of the market as a whole. Tools like the 
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Market Map, however, not only allow you to see both patterns, but to switch back 
and forth between micro and macro, and between asking questions and drilling 
down for answers, all through the same interface. 


If the Market Map tool is as good as it is, and easy to distribute over the Web, why 
isn’t information visualization everywhere? The same tool could be applied to retail 
sales in different stores/regions, student performance on tests by school, load factors 
on airline routes, employee productivity by factory or product line or shift, etc. 


The answer is that while the Market Map creates a powerful overall sense of the data 
it represents, it is not a universally extensible solution. It requires data in a certain 
format (numerical comparisons among equals), it requires careful matching of 
underlying variables to visual representation (color for performance and area for 
market cap), and most of all it requires that the users understand how to read it. This 
is the way information visualization is moving into the world — one sector at a time, 
with lots of customization and user training along the way. 


Panopticon: Extending financial visualization 

One success factor for information visualization tools is a willingness to specialize. 
Panopticon, a Stockholm-based firm, has taken this approach by extending the heat 
map to a number of new markets, as well as extending it to other sorts of numerical 
spaces, such as internal sales figures. The company, named for Jeremy Bentham’s 
hypothetical prison where one guard could observe all the inmates, seeks to let users 
extract the essence from large volumes of data. 


Panopticon’s focus on the financial market reflects its heritage. Originally built as a 
data tool for emerging-markets broker Brunswick Direct (since closed), Panopticon 
was spun off as a separate venture in 2001, with a mission to customize heat map- 
based tools for external clients to offer to their users. CEO Willem De Geer talks in 
terms of knowledge advantage. The Panopticon site even quotes the French philoso- 
pher Michel Foucault, who wrote about Bentham’s Panopticon: “. . the power to 
dominate rests on the differential possession of knowledge.” 


While the original Market Map presented only a few of the vast number of data sets 
that financial professionals deal with daily, the Panopticon version has extended this 
idea to other large financial data sets. Panopticon has created several tools, each cus- 
tomized for a particular kind of client and a particular kind of financial data — for 
example, its options pricing map for the Bourse de Montreal. 
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For Credit Suisse First Boston (SEE FIGURE 2 IN THE COLOR INSERT; WWW.PANOPTICON.COM), 
Panopticon took the previously abstract market map with its two simple variables, 
color and surface area, and added a third — a real, underlying map. The CSFB inter- 
face shows the overall performance of several dozen of the world’s stock markets, 
superimposed on a map of the world. In the same way the original Market Map 
allowed comparisons between individual companies, sectors and the whole market, 
the CSFB global view allows comparisons between individual markets, regions such 
as Northern Europe or South America, and the whole world. In addition, by being 
laid out on a world map, it also arranges the markets by time zone, so that the flow of 
market openings and closings across the world is implicitly tracked. This is an exam- 
ple of the kind of density that Edward Tufte lauds, adding more data to an existing 
interface to make it richer, while not muddying the clarity for the user. 


Panopticon’s interfaces are optimized for data that can be directly compared in 

numerical terms, and where you want to know the patterns of both individual ele- 
ments and aggregates. Market data fits this profile perfectly because 
comparisons like price movement and volume can be expressed 
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numerically, and they mean the same thing whether you are talking 


Headquarters: Stockholm, Sweden 


Founded: June 2000 
Employees: 14 


about individual companies, sectors, or whole markets. 


In addition to market data analysis products, Panopticon has creat- 


Key metric: 30 clients in 10 countries ed a Management Center product to let managers use heat-map 


and in 6 sectors 


Funding: spun off from Brunswick Group 


URL: www.panopticon.com 


visualizations for internal company data, such as revenues per 
employee, department, and business. The intuition behind the 
Management Center is simple: Most managers are dealing with a 
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portfolio of things with variable performance, such as widgets per 
hour, or profit and loss over several business units, and much of the 
valuable data is in-house but too widely dispersed to be useful. The Management 
Center provides a heat-map interface in which managers can quickly view a large 
number of inputs, in order to derive both general trends and to identify unusually 
weak or strong performers. 


Panopticon also provides a set of interfaces for companies to produce their own heat 
maps from a variety of in-house data sources, ranging from the simplest Excel 
spreadsheet, to complex OLAP and ODBC-accessible data. It will also begin licens- 
ing the Map API to companies who want to integrate Panopticon’s products into 
other tools this month. 
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De Geer characterizes their strategy as moving from one market segment to the next, 
customizing tools along the way. Although Smart Money opened many eyes to the 
possibilities of information visualization, that alone is not enough to build a viable 
businesses. “The first sale [in a particular market] we always have to do ourselves,” 
says De Geer. “The question is how we leverage that sale. If there is not a clear 
income stream, not a reference client to point to who is making money, people wont 
buy it. There are so many cool technologies out there, but cool is not enough.” De 
Geer is also seeing more business in Europe. “When we started, most of our revenue 
was from the States, but now it’s about 50/50 [with Europe] as the concept spreads.” 


Panopticon is currently concentrating on three verticals: financials, corporate man- 
agement, and life sciences. Pricing is based on the number of users, the value of the 
data to the client’s core business, and the amount it costs them to process that data 
without the tool. Because of the need to customize the interface for particular data 
feeds and to educate users about how to work with the Panopticon interface, the 
company’s strategy is to “grow with the people who know the business,” says De 
Geer. Panopticon is working with consultancies and IT vendors who know the needs 
of banks or life sciences firms, and who can explain the value of information visual- 
ization in terms of the potential client’s needs. 


Webs without spiders: Plumb Design’s Thinkmap 

As outlined above, one key function of information visualization is to reveal hidden 
relationships. This is relatively easy with structured data: The data in a database has 
usually been selected to fit into defined fields in a relational database, and analysis 
tools can structure that data. But what if the data youd like to explore can’t be com- 
pared in a numerical fashion? What if the data being examined is highly intercon- 
nected in ways that defy numerical description? 


This is the question New York City-based Plumb Design addresses with its 
Thinkmap product. Thinkmap is an interface designed around principles of self- 
organization, on the assumption that the internal relationships within the data itself 
should be what the interface displays. The original public version of the Thinkmap 
interface was a thesaurus (SEE FIGURE 3; WWW.VISUALTHESAURUS.CcoM) launched back in 
1997 and still live on Plumbdesign.com. The thesaurus uses the “weighted spring” 
model for connecting data, which represents connections as springs whose inter- 
locking pulls cause the interface to stretch and snap into shape, and helps heavily 
connected clusters draw together automatically. 
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The thesaurus shows given words as points, and draws the connections between 
related words as lines. Clicking on a particular word brings it to the center of the 
interface, and all the other first-degree connections are dynamically re-arrayed 
around the word at the center. Thus, for any choice of a center, the user can see con- 
nected words, the ways in which those words are connected to one another, and, 
depending on how the interface is configured, the next ring of words connected to 
words in the central cluster. This is 


Figure 3: Thinkmap’s original Visual Thesaurus. The chosen word, “occur- 
rence,” is connected to its possible synonyms, with stronger ties in the fore- 
ground, indicated by text color. You can also see that the word “experience’ 
has a large cluster of connected concepts. 
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designed to create an interactive picture 
of a whole space of interrelated concepts, 
and shows clusters implicit in the whole. 
(We’ve previously covered theBrain, 
which uses a similar interface to help a 
user manage and link ideas and informa- 
tion (SEE RELEASE 1.0, DECEMBER 1997). 


mental object 


If this were merely a thesaurus, it might 
N be a nice tool for writers and nothing 
more, but the thesaurus is just a demo; 
the idea behind Thinkmap is much more 
general. Marc Tinkler, the designer of the 


Thinkmap, was simultaneously enrolled 


change 


in the architecture and computer science 
i programs at Carnegie Mellon when he got 
interested in the tension between top- 
down and bottom-up systems of organi- 
zation. His first effort used Web server logs to uncover the pattern of user behavior 
on various Websites. When he observed what users actually did when they visited a 
Website, he discovered that the patterns were very different from what the sites 
designers had expected. The users were finding patterns in the site that the designers 
had not realized existed. Turning this realization inside out, he began working on a 
Website interface that would change in response to the user’s behavior. This work 


eventually became the Plumb Design thesaurus. 


Initial sales of the Thinkmap were to Websites looking for the “cool factor,” from the 
Smithsonian Institution’s digital collection to Sony Music’s licensing department. In 
these more sober times, the Thinkmap has come full circle, says Tinkler, and is now 
mostly used for analytics — as a tool for visualizing interconnected information and 
uncovering valuable hidden structures in business data. The links have to be implicit 
in or added to the data itself, but the Thinkmap creates the visual arrangement that 
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brings out higher-order patterns. (Other companies covered in this issue can do the 
extraction of data as well. Inxight’s Star Tree (ste pace 22) complements its docu- 
ment management tools, Lumeta creates maps of the data it gathers about the shape 
of networks, and Semio, recently acquired by Webversa, created its Semio Map prod- 
uct to help users interpret the output of its document tagging technology.) 


The Thinkmap is particularly valuable, says Tinkler, in fields that rely on a large vol- 
ume of scientific data to drive business decisions, because data about chemical or 
biological compounds is often linked in ways not amenable to simple numerical rep- 
resentation. One of Plumb Design’s clients, a Fortune 500 chemical company, uses it 
in its R&D efforts to help find links between different compounds. Though the 
Thinkmap can show clusters and connections among any kind of linked data, from 
supply chains to boards of directors, Plumb Design is targeting industries with a 
voracious appetite for data management. Like many of the firms profiled here, they 
are focusing on financial services and life sciences, especially pharmaceuticals. 


Tinkler says the essence of the Thinkmap can be expressed in two themes. The first is 
to make the process of querying more like a conversation, by making the interface 
useful both statically and dynamically. At any given point, the inter- 


face should show something useful about the relationships between BLUME DESIEN INES 


the selected elements, while at the same time allowing the user to 
Headquarters: New York, NY 


: : eai 
alter quickly the information onscreen to see a new or adjusted set ere 


of relationships. Employees: 25 
Key metric: $3.5 million in revenue in 
The second theme is to lower the penalty for incremental queries. A 2001 
static database query that returns too large or too small or just too Funding: Undisclosed amount from 
uninteresting a data set typically has to be re-run from scratch. The a posers 
ystems 


Thinkmap, by contrast, allows users to triangulate their way to valu- 
URL: www.plumbdesign.com 


able data, over- and undershooting as they go, until they arrive at a 
data set they are interested in. While the current version treats both 
the data and the links in a relatively uniform way, Tinkler is working on a future ver- 
sion of the interface to include more dynamic properties, like weight or tension in 
the points and connections, so that the overall interface can include perceptual cues 
like “sag” and “stretch.” 


Plumb Design currently offers the Thinkmap either as a software product with a 
yearly license, or as part of a larger design effort by Plumb Design. Of the 20+ com- 
panies using some version of the Thinkmap, roughly half chose the licensing option, 
says ceo Michael Freedman, though even in these cases it usually gets involved in 
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some customization. “The likelihood of the client needing some customization is 
built into the license,” says Freedman, “and we often end up working with the cus- 
tomer at some point to help them get the most out of the tool.” 


Tinkler is quick to disavow grandiose claims about the Thinkmap replacing other 
tools for handling data. He understands that the Thinkmap will need to rely on 
external tools, such as Semio’s Tagger, for the data, and will exist alongside a plethora 
of other tools for handling data, from simple search engines to complex ERP pack- 
ages. Speaking about Thinkmap’s place in the enterprise, he likens it to a rear-view 
mirror: “It’s not the whole windshield, but you wouldn’t want to drive without it.” 


The Perennial Problem of Search 


Seeing implicit patterns in large data sets is useful, but sometimes the problem is not 
to find the large pattern made up of the small pieces, but to find one particular piece 
of data in a vast and growing field. This is a search problem, not a pattern-recogni- 
tion problem. As good as our search engines are, the search problem is still a vexing 
one. There are three intertwined problems in today’s typical search interface — lack 
of context, poor user skills at refining searches, and the tyranny of the list. 


The lack of context is perhaps the best-known problem in current search interfaces. 
If you search for “Jaguar” on Google, the top ten links include seven relating to the 
car, two relating to a software package for chemical analysis, and, at the number one 
spot, Apple’s latest operating system release. (None of the top ten links point to any- 
thing about actual jaguars.) No matter which of those things you were looking for, 
it’s unlikely yowd be interested in the other choices presented. 


Fixing this requires users to become adept at refining their searches, but most users 
still use single-word searches. The power hidden on the “Advanced Search” pages of 
the world’s search engines remains largely unused. And when the results are 
returned, they are presented as an ordered list, as if a search for several different top- 
ics could be linearly ranked in any obvious way. (Even Google’s system forces com- 
plex queries into a rank order that often fails to map to the user’s interests.) The 
search problem is a browse problem as well. Often the user wants to work her way 
down a hierarchy of concepts, from general to specific, as a way of zeroing in on what 
she needs. Information visualization techniques offer novel solutions to searching 
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FOUR INTERFACES 


Despite the variety of information visualization interfaces 
that have been invented in the nation’s labs, the majority 
of actual products use one of only four basic types of 
interface. Ramana Rao of Inxight (SEE PAGE 22) groups 
these interfaces into two categories: surface maps and 
“wide widgets.” 

A surface map is any interface that fills the 
screen and uses surface area and position of data to con- 
vey important relationships . Two basic types of surface 
maps are being offered in commercial products: heat maps 
and cartographic maps. Heat maps allow for quick com- 
parisons among many data points based on size and 
“heat,” usually expressed using color. Tools from 
Panopticon and San Francisco-based Hive Group's 
HoneyComb use this technique. Cartographic maps, such 
as the Antarctica interface, apply to abstract (or at least 
non-geographical) data sets the techniques cartographers 
have developed for packing enormous amounts of data 
into maps of real places. 

“Wide widgets,” by contrast, extend the idea of 
widgets in a GUI - buttons, toolbars, menus and the other 
basic building blocks of a graphical user interface - into 
visualization tools. The width of a wide-widget is concep- 
tual, not visual; a wider widget is more expressive. Within 
the wide-widget category, there are two principal types of 
interfaces: weighted spring and star trees. 

The weighted spring technique is the most com- 
mon, because there are so many fields where the pieces of 
data are of equal importance to the links between them. In 
this technique, the points of data are weighted balls that 
are linked by “springs” with a certain amount of virtual 
pull. The effect is that the more interconnected data will 
draw together into clusters, because the cumulative force 
of many interconnected links will be greater than that of 
more sparsely connected data. Plumb Design's Thinkmap 
(page 11), Lumeta’s network maps (page 25), and inter- 
faces such as those offered by Kartoo (page 18), Semio 
(page 23) and Valdis Krebs’ Inflow, all use variations on the 
weighted spring technique to organize data. 

Star trees are a version of hyperbolic tree maps, 
developed at Xerox PARC and commercialized by Inxight, 
that present vast amounts of hierarchical data in a “fish- 
eye” interface; the data at the center of the interface is 
magnified and the data around the edges is condensed. 

These four types of information visualization 
interfaces are not the only ones possible, of course. Vornoi 


area-filling maps dynamically draw optimal boundaries 
between a set of points; the MIT Social Media Group's chat 
circles use a line-and-bar interface rather than the line- 
and-point of weighted springs. 

Another notable technique not currently used ina 
commercial product is social network analysis, or SNA (SEE 
PAGE 27), which attempts to understand human interaction 
within both formal and informal organizations. Because 
SNA treats visualization as a means to an end, the work 
going on in places like Microsoft's Social Computing Group, 
IBM's Social Computing Group, and MIT's Social Media 
Group are highly tuned to specific problems, and are there- 
fore likelier candidates for commercialization in the short- 
term than the more esoteric products of research labs. 

Given the amount of academic research on infor- 
mation visualization interfaces, including the long-stand- 
ing infatuation with 3D interfaces, it is significant that the 
bulk of commercial products concentrate on so few of the 
possible tools. This suggests either a disconnect or along 
time lag between research and the creation of practical 
solutions for business use. That said, the growing realiza- 
tion that workers typically organize their work in a people- 
centric fashion (“Where's the meeting? The address was in 
a mail from the boss...”) indicates market demand for these 
kinds of tools. 

One new interface appearing this fall comes from 
Microsoft. George Robertson, a senior researcher at 
Microsoft Research and a colleague of Stuart Card's at 
Xerox PARC, has been working with his staff on a tech- 
nique called Polyarchy Visualization. This method of rep- 
resenting hierarchies builds on his work at PARC. 

If you have one hierarchy showing a company org 
chart, and another showing employees by location, then 
the intersection - “Show me the reporting structure of the 
Houston accounting staff" - can't easily be represented. 
Polyarchic visualization tries to solve this with a best-of- 
both-worlds approach. It allows user to see the connec- 
tions between two or more hierarchies in various ways, 
including as a web of links that cross the hierarchies. (Read 
the paper describing Polyarchy Visualization at 
research.microsoft.com/users/marycz/chiZ002poly.pdf.) 

In an important test of general user adoption of 
information visualization tools, the Polyarchy Visualiza- 
tion project is moving out of Microsoft's research depart- 
ment for inclusion in their Metadirectory Services, to 
appear some time early next year. 
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and browsing by presenting the user with richer ways of looking at volumes of data, 
and by giving them better tools for building complex queries. 


The way of the map: Antarctica 

One way to make rich interfaces intelligible is to use familiar visualizations, and no 
visualization is more familiar to the average user than a map. Cartographic maps 
display an extremely complex data set, mixing real-world borders (The US coastline 
looks like this), political borders (The US is north of Mexico), and other data (Green 
areas indicate high rainfall). Antarctica cto Tim Bray’s intuition was that you could 
take the sophisticated techniques cartographers learned over centuries about pre- 
senting data and apply them to maps of information spaces, creating visual inter- 
faces that were simultaneously familiar enough to feel intuitive, but flexible enough 
to represent abstract relationships. 


Bray is perhaps best known for his work on the original XML standard. He sees a 

clear connection between XML and Antarctica: “I’ve spent my life working on better 

access to information, and since 1999 I’ve been trying to build a better GUI for 
information that doesn’t happen to be on your desktop.” As a start- 


ANTARCTICA SYSSTEMS INFO 


ing point, Bray believed that the desktop metaphor — “the world’s 


Headquarters: Vancouver, Canada 


Founded: July 1999 


Employees: 11 


most successful information visualization application” — was the 
right kind of approach for managing moderate amounts of local 
information, but that it was not suitable to either the volume or dis- 


Key metric: First wave of deployments persed nature of data in a networked environment. Based on this 
includes IBM, Intel, and the US intuition, he began “metaphor shopping” and hit upon cartography, 
National Library of Medicine the art of making maps. 


Funding: Approximately US$4.5 million 


URL: www.antarctica.net 


from Canadian VCs 


Antarctica (SEE FIGURE 4; MAPS.MAP.NET) takes structured data and 


creates a map that groups large blocks of information based on 
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shared characteristics. It then subdivides those blocks into subsets in 
much the same way that the US is divided into states, which are then divided into 
counties. Bray notes that the two principal advantages of using maps as a metaphor 
for organizing large data sets are the hundreds of years of cartographic practice at 
compressing an astonishing amount of information into a limited area, and the 
familiarity users have with the conventions of map reading, such as size (Texas is 
bigger than Rhode Island), 2D positioning (Iowa is above Missouri), nested organi- 
zation (Queens is a county in New York), and a mix of different levels of specificity 
(Ohio and Cleveland are both units on a map). 
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In an interface choice reminiscent of the heat Set Landmark Teleport Export Help 


map, Antarctica’s maps use surface area to indi- 
cate the relative size of document collections. On 
ee a lE "e Script Archiv 
PHP: Hypertext Preprocessor 


Map.net, for example, there is a general-purpose Ses peparenet of Agriculture (USDA) Homepage 


browsing tool for Websites called Visual Net. It ` Nt "hae ae 


u Census parean 


shows a large volume of media and entertain- Untsa Bags pegohpent of ne 


R E { iirenion = ol "o United Nations 
ment sites and a much smaller volume of sites ——_ °° 3 
n ¿ Reference SH Kids and Teens 

related to health. Unlike heat maps, however, the ÉE, «fers na, Renny rie 
Antarctica interface also provides such map-like 5 Min a etre ubrary 

. . . . —_— ai 
features as principal sites of interest, marked as 

` ai 


“cities” (MTV is a city inside the Entertainment 


1-10 >| of2744646 sites 


region) and allows users to mark the map with — . - 
Figure 4: Antarctica’s Visual Net interface to the Open Directory 


* « » 
their own “landmarks. search engine. Major categories of links, such as Business or Sports, 
are represented as color-coded regions, and significant sites are 
shown as “cities” within those regions. 


Like the Plumb Design thesaurus, the Visual Net 
Website browser is an eye-catching demo, but the 
real action is elsewhere. Visual Net is meant to offer a compelling interface built on 
top of the structure derived from collections of information, whether provided 
directly by a database or taken from a harvester or other software application. 
(Antarctica has an alliance with the knowledge-management firm Semio, whose 
flagship Tagger product does this sort of extraction of structure.) The Visual Net 
interface then represents this structure in a way that takes advantage of the powerful 
set of expectations provided by real maps, in the same way the original GUI used the 
expectations provided by real desktops. 


Visual Net is good for data sets with a high degree of nested organization, where the 
user needs to see the forest but his goal is to find a particular tree within it. It is less 
well-suited for data sets that are hard to represent in purely spatial terms, such as 
market data, since the issue there has more to do with envisioning overall patterns 
and quantification rather than finding individual units. The ideal use is document- 
database browsing, such as helping the user look through categories to find individ- 
ual sites or through PubMed medical data to find individual articles. Bray says that 
Antarctica’s testing shows that the more users care about getting at the underlying 
data, the more they like the Antarctica interface. 


Like many information visualization firms, Antarctica has a life-sciences client, the 
National Library of Medicine. But the company is not focusing on particular verti- 
cals. Instead, the current strategic push is to pursue partnerships. Bray says, “Our 
product is really horizontal, and very synergistic with a lot of existing tools for man- 
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aging large amounts of data. When we work with good solution providers, we suc- 
ceed, and when we don't, it’s a lot tougher.” 


Map meets widget: Kartoo 

Laurent Baleydier, founder of the French firm Kartoo, believes that the problems in 
the design of search tools stem from too much left-brain thinking. “Data processing 
specialists are especially left-brain-oriented, and search software typically has strong 
mathematical logic, but is not too imaginative,” with “those interminable lists” as 
one of the worst problems. He thought search could be dramatically improved by 
bringing visual browsing into the search experience, letting users see the results 
grouped by theme, and then helping them heighten or suppress aspects of their 
search, as they hone in on what they want. These two goals — thematic grouping of 
related sites, and simple visual sharpening of the search — led to the interface of the 
Kartoo tool (SEE FIGURE 5; WWW.KARTOO.COM/EN/). 


The Kartoo interface presents the results of a search as a set of connected nodes, 

showing sites as balls that Baleydier likens to cities, and drawing lines between them, 

analogous to roads. (Though Kartoo calls this a cartographic interface, in the termi- 
nology used in this issue it is 
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Figure 5: The Kartoo version of a search for “Jaguar.” The size of the icons indicates rele- q 8 exp 


vance. Note the signs over the Apple site; clicking the minus sign would re-run the search how to build a complex query. 
excluding sites matching Apple’s Jaguar operating system release, while the plus sign Furthermore, by showing key 
would focus the search on the Apple version of Jaguar. concepts that link sites together, 
the interface lets the user skew 
the search to include or exclude themes as well. Once the map of the conceptual 


space of interest has been built by the user, it can be saved for later use; when that 


18 RELEASE 1.0 WWW.EDVENTURE.COM 


user returns, the query will be re-run and the presentation will 
include any additions to the data. (Rather than maintaining its own 
search database, Kartoo operates as a meta-search engine, running 
the user queries on external search engines and then importing the 
results.) 


After working out the original concept in 1998, Baleydier launched 
the first version of the interface in April of 2001. The company was 
founded in October of that year, and the English-language version 
launched in January of 2002. (Kartoo has also launched sites for 
French and Portuguese speakers, and is currently working on 
Spanish, Italian, and German.) It reports around half a million 
unique visitors per month for the French version, and nearly the 
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Key metric: Clients include Michelin, 
BNP Paribas, and three French gov- 
ernment ministries 

Funding: $120, 000 from founders 
Laurent & Nicolas Baleydier 


URL: www.kartoo.com 


same number for the English language version within two months of its launch. 


Because of its user-friendly presentation, so many children use Kartoo that the com- 


pany recently added a “Parental Filter” option as well. 


As with so many firms offering information-visualization tools, the Kartoo search 


interface gets publicity because of the “cool factor,” but much of the business oppor- 


tunity lies elsewhere. It works closely with its clients; According to Nicolas Baleydier, 


Laurent’s cousin and the company’s general manager, “Forty percent [of the value] 


comes from our heads, and 60 percent from our clients. They have the interesting 


problems.” 


Because the original conception of the interface had to do with making access to 


information more intuitive, the idea is applicable to a broader range of problems 


than the Web; the company is working on extending the interface to include a data- 


base search tool, which will let users build database queries in the same way they 


build Web queries today, as well as an intranet search, which can operate as a single 


solution or can be added as an interface to tools such as Verity or Autonomy. Kartoo 


is seeking partners who would license private-label versions of the Kartoo interface 


as part of a larger offering. 


Because search is such a deep and broad problem, Baleydier intends to continue his 


quest to make searching for data anywhere a more imaginative and right-brained 


experience: “Kartoo will continue to improve the relevancy and power of search, and 


on the animated presentation of the results. One should be able to consult the results 


of a search as one looks at a TV report, without effort.” 
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THE GOVERNMENT AND INFORMATION VISUALIZATION 


The US government is back in the technology game. 
Despite the seminal role DARPA and the National Science 
Foundation played in the creation and stewardship of the 
Internet in the first two decades of its life, by the 90s the 
government was viewed as unredeemably stupid and slow; 
the disaster upgrade of the air traffic control system by 
the FAA replaced the launch of the Internet as the illustra- 
tive example of what happens when government gets 
involved in technology. 

All of that changed after September 11, because of 
the enormity of the attacks themselves and the fact that 
they accelerated changes that were already happening. At 
the time of the attacks, the US was in tech-led recession, 
and having the government as a client was beginning to be 
viewed as an enormous plus. September 11 accelerated 
that process by helping to restore the Department of 
Defense to its historic role as a sponsor of advanced re- 
search and a test-bed for new technologies. While agen- 
cies such as DARPA are currently sponsoring long-term 
research through their various offices, such as the Infor- 
mation Awareness, Information Processing and Microsys- 
tems Office, other government organizations are 
operating on shorter time cycles, seeking to find and 
exploit technology closer to commercialization. 

One of the most innovative of these organizations 
is In-Q-Tel, a venture capital group sponsored by the CIA 
to invest in the promising technologies of interest to the 
Agency and the intelligence community. (The name comes 
from “intel,” short for intelligence, punctuated by Q, the 
name of James Bond's gadget-designing colleague.) In-Q- 
Tel's ceo, Gilman Louie, formerly chairman at game-maker 
Spectrum Holobyte, sees a great demand for information 
visualization tools on the part of the government. Louie is 
convinced that the most urgent short-term problem is 
that people are using their desktops and other simple 
graphical tools to organize information visually, even 
though the desktop was never designed for those tasks 
and is inadequate for handling the volume of information 


the average knowledge worker is presented with. 

He sees great opportunity for anything that helps 
extract useful information from ever-expanding data sets. 
However, he is skeptical about the market's ability to sup- 
ply those tools in the short term. 

Louie sees two principal issues. The first is the 
“coolness factor.” At its worst, the move from graphics to 
information visualization merely upgrades chartjunk to 
interactive chartjunk, where the visual appeal of the dis- 
play overwhelms mundane usefulness. “The things I've 
seen are still not intuitive, which makes it hard to take 
something out of the labs and turn it into a real product. | 
think information visualization research needs to be bet- 
ter linked with user interface research.” 

The other problem is that the data that most 
workers are crying out to have organized is the unstruc- 
tured email and files they already deal with every day. As 
reports appear about intelligence that existed within vari- 
ous government and law enforcement agencies prior to 
9/11, it is clear that simply capturing larger patterns from 
the data that is already present could be a big improve- 
ment. “It's Office and Lotus Notes; that's the data people 
are drowning in. A tool that presents me with new ways of 
looking at new data is not nearly as useful as a tool that 
presents me with new ways of looking at the data | have to 
deal with every day," says Louie. 

Longer-term, however, he is more optimistic 
about commercialization of information visualization, both 
for government and private clients. In particular, he thinks 
the details of government procurement requirements will 
force the practical issues to the fore, potential contractors 
will have to deal with the these issues to win a contract. 
“This will get solved by the market at some point, because 
this isn't just an academic exercise; this is a real problem. 
People are saying ‘I've got too much data to deal with, 
right now, and | need a way to see it all.’ As long as that 
demand is out there, someone will be rewarded for finding 
a solution.” 
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Tuned abstraction: Spotfire 

The key part of Spotfire’s DecisionSite (SEE FIGURE 6 IN THE COLOR INSERT; WWW.SPOT- 
FIRE.COM/PRODUCTS/MAPPER.ASP), its flagship “analytic application environment,” is the 
optimization of the responsiveness of the interface. It is designed to support what 
Spotfire calls” guided analytics,” i.e. visualization customized in real time, based on 
user behavior. 


Unlike users of searching or browsing interfaces, DecisionSite users typically don’t 
know what they are looking for while they are using the product, so it is optimized 
to support an exploratory relationship between the users and the data. DecisionSite 
is designed to be able to import data from a variety of sources, then offer users sever- 
al different possible representations of that data. Users can pose and alter queries 
and receive instant responses, enabling them to shape their questions accordingly 
and to manipulate the interface to better support their needs as they learn the data 
set. According to Christopher Ahlberg, the creator of Spotfire, DecisionSite lies 
“somewhere between interface and database.” 


Spotfire’s client list is heavily weighted toward businesses that must integrate scien- 
tific data with business logic. The company has over 500 clients, including the top 25 
pharmaceutical firms and four of the top five oil companies. It licenses its various 
tools and customizes them for certain problems (DecisionSite for Functional 
Genomics, DecisionSite for Process Analysis, etc.), attacking the user-education 
problem by making the tools easy to use and packaging them with 


guides, which are essentially domain-specific manuals with best 
SPOTFIRE INFO 
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For example, it has partnered with Agilent, the HP spin-off, to offer Sprout Group, InnovationsKapital, 
DecisionSite as part of a larger knowledge-management solution. Pequot Capital; Quantum Fund, 


Cooper Hill Partners 


URL: www.spotfire.com 


Despite working in Ben Shneiderman’s lab at the University of 


Maryland in the early 90s, Ahlberg is not optimistic about the short- 

term applicability of the information visualization work being done in most acade- 
mic labs today. He sees the labs focusing on esoteric work, when his business 
experiences have taught him that the user wants a simple and effective interface; the 
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core user demand for these tools is to “Keep it fast.” As it continues developing its 


tools, Spotfire is focusing on applying that lesson. 


Inxight: The xent of information 


In many cases, the core problem in visualizing a set of data is not the number of 


points of data to be represented, but the number of interconnections. Hierarchies 
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present a particular problem; They can even be worse than webs, 
because webs double back on themselves while hierarchies always 
involve all-new links in every generation. If you are charting the 
interrelationship among documents, and each one references half a 
dozen others, a chart that goes three levels deep involves only a few 
hundred documents, but one that goes six levels deep involves tens 
of thousands of interconnected documents. This problem appears 
in any collection of information where there are several layers of 
dependencies, from large organizational hierarchies to complex 
tasks such as assembling a car. In the late 80s at Xerox PARC, Stuart 
Card, Jock Mackinlay, and George Robertson designed a number of 
3D interfaces to attack this problem (see sipeBar, PAGE 15). In the 
early 90s, Ramana Rao began looking for solutions that would avoid 


the problems associated with 3D, particularly the limited hardware of the average 


user. During this research, he co-invented the hyperbolic browser, later renamed the 


Star Tree, with John Lamping. 


Figure 7: Circle Limit IV, by M.C. Escher. As 
the interlocking angels and devils approach 
the edge, they get infinitely smaller, thereby 


making up the (theoretical) border. 
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The inspiration for the hyperbolic tree map was a set of circular 
prints by M.C. Escher (see Figure 7), which involved interlocking 
shapes that grow smaller as they get closer to the edge of the circle, 
thereby creating the illusion that the edge itself was made up of an 
infinite number of infinitely small shapes. 


The Star Tree (SEE FIGURE 8; WWW.INXIGHT.COM/PRODUCTS/VIZSERVER/) 
works the same way, providing a “fish-eye” view of data. Some part 
of a hierarchy is the center of the circle, with the linked data 
expanding outwards and the scale shrinking exponentially towards 
the edges, so that the bulk of the information is represented as clus- 
ters of links at the edge of the circle. This allows the tree to repre- 
sent a very large number of points in finite space, without needing 
to know in advance exactly how many points are involved. 
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The concept was given an interactive aspect in 1993, when Rao and Lamping devel- 
oped a browser that could take large hierarchies of data and render them hyperboli- 
cally in real time. Clicking on a node would move it 


to the center, where it would expand, while the rest VL 
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duces what he calls “the scent of information,” 


where the interface shows you not only where you 

are, but gives you hints about where you can go next. Studies at PARC suggest that 
the combined specific and general view of a hierarchy can allow the user to locate 
items of interest considerably faster than with traditional search techniques. 


In 1996, Inxight was spun off from PARC, in order to capitalize on the hyperbolic 
tree and a number of other PARC inventions in document management. Coming 
from this document-management tradition, Inxight links search and knowledge 
management tools with its hyperbolic browser. Inxight has licensed the interface on 
an OEM or partnership basis to a number of solution providers, including 
Computer Associates, HP, SAS and Tivoli. Rao, now the cto and svp at Inxight, says 
the company’s future information visualization work will continue to “design 
around the skills we had before we are born. How much information can you be act- 
ing on at any given moment? How can we take advantage of evolved spatial and per- 
ceptual skills like peripheral vision, so that while you're working, you're aware of the 
space of possible next places?” 


The patterns in the words: Semio 

Garbage in, garbage out: Information visualization is only as good as its input. Much 
corporate data is not amenable to visualization because it exists in Word and 
PowerPoint files, rather than databases or other structured information storage. All 
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INFORMATION VISUALIZATION RESEARCH 


Though information visualization techniques are only now 
seeing widespread commercialization, as a research disci- 
pline the field is roughly 15 years old. The various threads 
of interface design, human-computer interface, and infor- 
mation density have existed for decades, but the field 
really coalesced with a push from the US government, in 
the form of the 1987 National Science Foundation report 
“Visualization in Scientific Computing.” 

Since then, the bulk of the work in information 
visualization has gone on in academic and commercial 
labs, with only a small percentage of the work crossing 
over into commercial applications. Two of these labs stand 
out. The first is Ben Shneiderman's Human-Computer 
Interaction Laboratory at the University of Maryland. 
Shneiderman's work on tree maps later became heat 
maps, the concept behind Smart Money's Market Map 
(SEE PAGE 7), as well as Panopticon's (SEE PAGE 9) and 
the Hive Group's products. In addition, Chris Ahlberg 
worked in Shneiderman’'s lab in the early 90's before leav- 
ing to found Spotfire (SEE PAGE 21). 

The other is Xerox PARC, which got into informa- 
tion visualization in the mid-80s, when John Seely Brown 
began canvassing the staff about areas where PARC 
should make big bets. Because PARC was home to the 
Alto, one of the first computers with a GUI and running on 
one of the first LANs, the PARC staff could already see 
that the desktop was running into limits as a visual tool 
for organizing a user's view of the files stored locally or 
on the network. Under Stuart Card, they began looking at 
ways to present alternate views of document space, lead- 
ing to the hyperbolic tree map, later to be rechristened 
the star map and licensed by Inxight (SEE PAGE 22). 

Alan Kay once said, “The best way to predict the 
future is to invent it.” Shneiderman and Card have tried 
an alternate approach: “The best way to predict the 
future is to describe it." Their Readings in Information 
Visualization, published in 1999 and co-authored with 
Jock Mackinlay, is one of the key textbooks in the field. 

Many information visualization techniques used 
by companies come out of applied research that treat 
information visualization as a means to an end. Lumeta, 
which was spun off from Lucent in 2000, uses these tech- 


niques as a way of representing existing network data. 
Likewise, the enormous growth in the number of social 
network analysis visualization techniques from labs such 
as Microsoft's Social Computing Group, IBM's Social 
Computing Group, and MIT's Social Media Group are opti- 
mizing information visualization to solve particular prob- 
lems in understanding social organization. 

Another strong area of applied research is in 
interpreting data about the Internet. Martin Dodge, who 
runs the Center for Advanced Spatial Analysis (CASA) at 
University College of London, maintains Cybergeogra- 
phy.org, a site dedicated to listing and pointing to the 
wide variety of representations of both technological and 
social views of the Internet. Likewise, the Cooperative 
Assoc-iation for Internet Data Analysis (CAIDA) runs a 
site with a number of visualization tools dedicated to the 
“engineering and maintenance of a robust, scalable global 
Internet infrastructure.” 

Even the art world is a fertile source of experi- 
mentation. The artist Mark Lombardi is famous for his 
hand-drawn network analysis diagrams of the relations 
between business and politics. (Most provocative title so 
far: “george w. bush, harken energy, and jackson stevens c. 
1979-90.") The art collective Rhizome.org features a vari- 
ety of alternative interfaces for browsing its collection, 
and collects artistic experiments in information visualiza- 
tion.To tie all the threads together, Martin Wattenberg, one 
of the artists whose work is shown on Rhizome, also 
invented the Smart Money Market Map, which was in turn 
derived from Shneiderman's work at HCIL. 

Despite the variety of techniques resulting from 
all this research, the fact that most of the information 
visualization in the commercial world falls into a few cate- 
gories (SEE SIDEBAR, PAGE 15) suggests that there is no 
clear or obvious path from the lab to the field. HCIL has 
had as much or more success as commercial research 
groups in getting its tools and techniques adopted com- 
mercially. The 15-year history of the field suggests that 
despite the R&D title, lab work in information is almost 
pure “R,” even in commercial labs, and that the “D” has 
been outsourced to firm s determined to make the tools 
user-friendly. 


24 RELEASE 1.0 


WWW.EDVENTURE.COM 


information visualization tools rely on some source of ordered data, mane 


often being designed to interoperate with existing knowledge man- 
Headquarters: Reston, VA 


agement software. In Semio’s case, it had already created the knowl- 
Founded: 1996 


edge-management software, in the form of its Tagger product, S 


which takes the unstructured documents that make up the bulk of Key metrič Recentiy acguired by 


most organizations’ digital repositories and creates a set of metadata Webversa 
in XML format that describes the topics and structure of those doc- Funding: $22 million from VCs in the US 
uments and their relationship to one another. (Like most of the URL: www.semio.com 


businesses in this issue, Semio also interoperates with other soft- 
ware. The Tagger produces standardized XML output that can be by 
Antarctica’s Visual Net, and Semio Map can read and interpret the structured output 
from, inter alia, Plumtree Portal and Verity.) 


Simply deriving implicit structure from a group of unstructured documents does not 
guarantee that anyone will actually be able to glean useful insights from that struc- 
ture. This problem grows larger as the amount of indexed information grows. Semio 
turned to information visualization, with its Semio Map product, as a way of helping 
the user view and interpret the implicit structure extracted by Tagger. 


(As this issue was being written, Semio was acquired by Webversa, a firm that pro- 
vides real-time alerting and data access from phones and mobile devices.) 


Visualizing networks: Lumeta 

Lumeta’s famous map of the Internet (SEE FIGURE 9 IN THE COLOR INSERT; 
WWW.LUMETA.COM/GALLERY/) is a beautiful and startling image — a huge cloud of colored 
lines that show where networks connect to one another, with the heavyweight net- 
works like AT&T and Uunet in the center and countless smaller networks arrayed 
around the edges. In the words of Bill Cheswick, one of its creators, “It looks like a 
peacock hit a windshield.” 


The project started in 1996, after a brainstorming exercise at the Highlands Forum 
(a periodic conference on technology issues hosted for various organizations within 
the Department of Defense) about the possibility of network attacks, which led 
Cheswick to realize that understanding where the Internet interconnected might be 
useful in a disaster. He then began working with Hal Burch, a PhD candidate at 
Carnegie Mellon, on a program that would scan the network and return data about 
interconnections. Presented with a set of tens of thousands of points, they set about 
designing a simple, meaningful way to show the results. They ignored most of the 
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current research in information visualization, preferring instead to use “brute force 
and pig ignorance,” as Cheswick puts it, to build their own version of the ’weighted 
spring” interface. Moore’s Law was on their side: Ordinary workstations delivered 
enough raw processing power to allow tens of thousands of points to be crunched 
and drawn, even as the published literature of the time suggested that handling a few 
hundred such points was hard. 


Lumeta was spun off from Lucent to commercialize the mapping work and a related 
algorithm that would locate network vulnerabilities. (We first covered Lumeta about 
6 months after it was founded (sEE RELEASE 1.0, MARCH 2001).) The idea was that while 
every company might have different business problems, they all had similar IT prob- 
lems; moreover, the larger the firm, the more intractable the problems. One surpris- 
ing and persistent corporate IT challenge is simply to understand what is connected 
to anetwork. This comes about in part because it’s so easy for employees and depart- 
ments to add new equipment to the edges of an existing network, 
and in part because both mergers & acquisitions and divestitures are 


LUMETA INFO 


typically finished on paper long before the related networks are fully 


Headquarters: Somerset, NJ divided or combined. 


Founded: Fall 2000 
Employees: 29 


Funding: $13 million from Draper Fisher 
Jurvetson, New Venture Partners, 


Wachovia Strategic Ventures Group 


URL: www.lumeta.com 


When working with clients, Lumeta uses a simple effect to drama- 
tize the value of the information it collects: On the map it creates of 
the client’s network, it shows the parts of the network that the com- 
pany identified in advance in blue, and uses red to detail the parts 
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that no one in management knew (or told) about. (Cheswick relates 
how he ran early versions of the map at Lucent and found machines 
from other companies directly attached to the Lucent network, a legacy of incom- 
plete divestitures.) This dramatic representation of “here’s what you didn’t know” 
helps focus management and IT staff on both the size and location of potential 
problems. 


One key function of Lumeta’s software is to help improve network security by iden- 
tifying vulnerabilities. However, Cheswick has found that network security is still 
only a middling priority for IT departments. He notes, “The security guys want our 
software, but the network guys pay for it.” Now, with heightened awareness of all 
sorts of security concerns post-9/11, coupled with the general growth in network 
break-ins and related crimes such as identity theft, the position of those members of 
the IT department charged with network security may be improving. Tom Dent, 
Lumeta’s ceo, says that the aftermath of the attack has also raised Lumeta’s profile, 
especially with US government agencies: “Our experience has been that after 9/11, 
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government agencies were both more aware of their network security issues, and 
more aware that we were offering something that could help.” 


In addition to security, a key advantage of visualizing a network is to set an initial 
baseline and watch it change over time, in order to make sure that the network is 
developing in ways you intend and not being extended or breached in ways you 
don’t know about. While still at Lucent, Cheswick and Burch produced a movie of 
the Lucent network changing over time, by taking periodic snapshots and using 
them as individual frames in an animation. That kind of lightweight but powerful 
addition, rather than ’wouldn’t it be cool” research into 3D display, is the best way to 
increase the usefulness of visual tools in the near term, says Cheswick. 


Cheswick says information visualization represents only a third of the value Lumeta 
brings to its clients, with the rest coming from the other methods it has for under- 
standing network topologies and from security audits and performance tuning. Its 
biggest clients are financial services firms and government agencies, both of which 
have significant security and auditing needs. However, because every large firm has a 
large network (usually built piecemeal), Lumeta does not have to focus solely on 
these verticals; it counts clients in transport, chemicals, and computer hardware and 
software as well. It is also responding to demand by its clients for even finer corporate 
control over data. The current model is to sell the service on an ASP basis, but corpo- 
rate demand for daily or real-time network scans is leading it to work on a Lumeta 
appliance, which could be deployed directly within a corporate LAN or WAN. 


Visualizing Ourselves: Social Network Analysis 


Shortly after the September 11th attacks, a network map of a different sort was pub- 
lished, one showing the relation of the 19 hijackers to one another. The map was 
derived from published reports of relations between the individuals, and made clear 
what a key figure Mohamed Atta was in bringing together several unconnected clus- 
ters. The “hijacker” map (SEE FIGURE 10; HTTP://WWW.ORGNET.COM/HIJACKERS.HTML) Was 
created by Valdis Krebs, one of the pre-eminent figures in the growing practice of 
social network analysis. It is now the most widely distributed piece of social network 
analysis ever created. 


Social network analysis (SNA) is the kind of problem that information visualization 
was made to handle. As complex as our collections of documents or networks of 
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computers might be, nothing rivals real human networking, where an enormous 
range of human context and different kinds of relationships (“I know her, but I 
don’t really know her”) makes the problem exponentially more complicated. If you 
attempt to chart the interconnections among a group of even a few individuals, you 
will get one drawing if you ask who knows whom, a second if you ask who likes 
whom, and a third if you ask who trusts whom. Moreover, the links can differ by 
direction; Juan may like Alice even though Alice does not like Juan. 


We are able to handle some of this human context, often subconsciously, in keeping 
track of our own social worlds, but there are two strong limitations on our ability to 
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Figure 10: Valdis Krebs’ social map of the September 11th terror attacks 
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understand social networks. The first is scale: The primatologist Roland Dunbar 
argues that the size of primate brains reflects the size of the social groups they need to 
keep track of. He claims that the size of human brains suggests that we are optimized 
for keeping track of an extended social group of 150 or fewer members. (Malcolm 
Gladwell cited Dunbar’s thesis as “The Rule of 150” in The Tipping Point, his influen- 
tial book about the nature of epidemics — social and otherwise.) 


The second limitation is that we are bad at understanding social groups we don’t 
belong to: Anyone who has ever been excluded from a clique or other tight social 
group knows how opaque it seems from the outside. 


Social network analysis would seem to offer a solution to both of these problems. 
However, SNA is not yet at the point where a business can simply buy an SNA soft- 
ware package. The companies offering any sort of SNA are almost exclusively service 
businesses. Krebs’ own firm, Orgnet, is primarily a vehicle for his consulting busi- 
ness, and his biggest client, IBM, eventually licensed his tools to use them in its own 
consulting arm, not to re-sell them. Knetmap.com (pronounced “netmap”) uses a 
simplified version of Krebs’ software as part of a consulting package for its corporate 
clients. It is often coupled with its “Question of the Week” service, in which Knetmap 
maps who knows what in a company by emailing the employees one survey question 
a week, over a period of several months. Most of the SNA work going on is still in 
research labs, from Microsoft’s work on mapping Usenet conversations, to Warren 


>.» 


Sack’s ”Large Conversation Map” at Berkeley. 


Conclusion: Product or Service? 


Social network analysis provides an interesting test case for the place of information 
visualization in the business world. Does it exist mainly as a service because it is so 
context-sensitive that the techniques can’t be packaged, or is it a matter of time 
before the techniques and end-user awareness make sales of SNA products practical? 


Over the last 15 years, information visualization has gone from academic exercise, to 
an application with high “coolness factor,” to a handful of useful tools optimized for 
specific jobs. The Market Map illustrates many of the themes common to successful 
information visualization. First, it is specifically crafted to do what it does well for a 
narrow problem, using a deceptively simple interface. 
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Next, the Market Map operates in a well-known space, uses a handful of key numeri- 
cal variables, and has carefully mapped the right visuals to the right variables. 
Though market cap and performance are both dynamic, performance changes much 
faster and operates in a narrower dynamic range. Companies differ in size by orders 
of magnitude, but a company that outperforms the market (or its sector), even by a 
few percentage points, is a standout. By mapping the more dynamic but compressed 
data to color, which we can compare more easily than surface area, the Market Map 
communicates the main issue quickly, while allowing users to compare market cap 
as a second order issue. 


Color also maps well to performance because performance can be positive or nega- 
tive, while market capitalization can only be positive. The use of color indicates posi- 
tive or negative performance at a glance — red or green — while that kind of 
refinement would be wasted on the simple “bigger or smaller” comparison of com- 
pany size. The Market Map also takes advantage of symbolic color cues: Red is for 
blood, red is for stop signs, red ink is a loss; green is for growing plants, green light 
means go, and so on. (Imagine how hard the Market Map would be to read success- 
fully if surface area were daily performance, while color was market capitalization.) 


The overall lesson is that a general-purpose information-visualization tool does not 
exist, and that significant attention is required to match a problem space with a par- 
ticular representation. All the businesses offering information visualization tools 
have embraced this lesson in one way or another, usually by offering customization 
as a service, focusing on specific vertical industry segments, or both. 


What happens next is the billion-dollar question. Will information visualization 
become familiar enough to be offered as a pure product, or will it always be part of a 
“sales and service” package that includes heavy customization, training of end-users, 
and consulting services for interpreting the results? The answer, of course, will deter- 
mine how the business prospects for information visualization unfold. A pure soft- 
ware business can scale very quickly in both size and global distribution, and if the 
audience grows large enough, the profit margin on each installation can be very 
high. A sales and service business, by contrast, can only scale as quickly as the sales 
and support staff can be hired and trained, and because each new region or vertical 
requires new staff, margins are often capped. 


Though at the moment most companies are focusing on the sales and service model, 


two forces are pushing information visualization in the direction of pure software, 
with an optional service component. The first is simply the huge demand for new 
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ways to deal with information overload. People are drowning in data (with more on 
the way!), while the standard tools people have for sorting and managing that infor- 
mation — their email in-boxes and the desktop, mainly — are showing signs of strain. 
This gap between information overload and the current generation of graphical 
tools represents an opportunity for information visualization, as it is an incentive for 
potential clients to educate themselves about alternative tools. 


The other force at work is familiarity. Because current information visualization 
interfaces are clustered around a few techniques, these techniques may eventually 
become familiar enough to seem intuitive. The enormous conservatism of GUI 
development from the original Xerox PARC Alto to Windows XP and Apple’s OS X 
suggests that visual tools develop in a fashion Stephen Gould (RIP) called punctuat- 
ed equilibrium: Novel features, once successfully introduced, are adopted widely but 
improved upon only slowly and conservatively. In such a system, 
occasional bursts of “new” alternate with long cycles of “somewhat 
improved.” If this pattern holds true for information visualization, 
we are in one of those long cycles of improvement, where tech- 


COMING SOON 


niques developed in the lab years ago have finally gotten good 
enough to meet the needs of the average user. The earliest market 
penetration is coming in data-intensive verticals, but as the tech- 
niques improve, the range of possible commercial uses is enormous. 


Although there is always the possibility that some incredible tech- 
nique will come out of an academic environment and turn things 
upside down in a short period of time (c.f. the Web; Google), there 
is general skepticism — even among people currently engaged in 


e Grid Computing 
e Digital Identity for objects. 
e Human Capital 


e And much more. .. (If you 
know of any good examples of 
the categories listed above, 


please let us know.) 


commercializing research work — that any such revolution is likely. 
Instead, the likeliest advances in information visualization will 
come from further development of tools for extracting the data to be visualized, and 
from refinement of existing visualization techniques. 


The fact that the technology is unlikely to undergo dramatic changes any time soon, 
however, does not mean that the business won't. Indeed, information visualization 
seems to have reached that tipping point first identified by Alan Kay: “Good enough 
to criticize.” Near-term improvements will likely be a result of dialogue between tool 
developers and their clients. Though this kind of dialogue is no guarantee of success, 
it is a necessary condition for solid technology to go from “useful in a few verticals” 
to “useful everywhere there is too much data.” 
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One key issue, as always, is user friendliness. The tools have tended to come from 
pie-in-the-sky research environments, but real users face tradeoffs that don’t favor 
the experimental. In this environment, offering long-term improvements in ROI 
isn’t enough; the learning curve has to be shallow enough to offer the user partial 
results within a few hours (or minutes!) of getting the tool. Furthermore, the focus 
of tool developers on heavily indexed data ignores the information overload prob- 
lems most users face, which has much more to do with the stuff on their desks than 
the stuff in their databases. 


The industry is still young, however; even Spotfire, one of the oldest of the firms 
profiled here, was founded only in 1996. Furthermore, time is on its side, because the 
demand for better tools and the supply of better techniques are both growing. Of 
our five senses, vision is the one best optimized for taking in and processing huge 
volumes of information quickly, and someone is going to figure out how to exploit 
that capability in novel ways. As Ramana Rao put it: “Everyone thinks revolutionar- 
ies are impatient, but we are actually extremely patient. We are so stubborn we will 
keep at it ‘til it happens.”  ggri.o 
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Resources & Contact Information 


Martin Wattenberg, mwatten@us.ibm.com 

Tim Bray (c/o Stephanie Lummis), Antarctica Systems, 1 (604) 873-6100 ext. 110; fax, 1 (604) 873-6188; lum- 
mis@antarctica.net 

Ben Shneiderman, HCIL/University of Maryland, ben@cs.umd.edu; http://www.cs.umd.edu/~ben 

Gilman Louie, In=-Q=Tel, 1 (703) 248-3020; fax, 1 (703) 248-3039; glouie@in-q-tel.org 

Ramana Rao, Inxight, 1 (408) 738-6410; fax, 1 (408) 738-6352; rao@inxight.com 

Alexandre dos Santos, Kartoo; alex@kartoo.com 

Bill Cheswick, Lumeta, 1 (732) 357-3500; fax, (732) 564-0731, ches@lumeta.com 

Tom Dent, Lumeta, 1 (732) 357-3500; fax, (732) 564-0731; tdent@lumeta.com 

Willem De Geer, Panopticon, 46 (8) 50 68 79 850; willem@panopticon.com 

Michael Freedman, Plumb Design, 1 (212) 285-8600 ext. 223; fax, 1 (212) 285-8999; freedman@plumbdesign.com 

Chris Engle, Plumb Design, 1 (212) 285-8600 ext. 227; fax, 1 (212) 285-8999; cengle@plumbdesign.com 

Christopher Ahlberg, Spotfire, 1 (617) 702-1550; fax, 1 (617) 702-1700, ahlberg@spotfire.com 

Tom Lewis, Webversa (Semio), 1 (703) 207-0400; tlewis@webversa.com 


Information Visualization Tools & Demos: 

Antarctica — http://maps.map.net 

Inxight — http://www.inxight.com/products/vizserver/ 
Kartoo — http://www.kartoo.com/en/ 

Lumeta — http://www.lumeta.com/gallery/ 

Panopticon — http://www.panopticon.com 

Plumb Design — http://www.visualthesaurus.com 

Semio — http://www.semio.com/products/semiomap.asp 
Smart Money — http://www.smartmoney.com/marketmap/ 


Spotfire — http://www.spotfire.com/products/mapper.asp 


For further reading: 

University of Maryland Human Computer Interface Lab — http://www.cs.umd.edu/hcil 

Readings in Information Visualization, by Stuart K. Card, Jock D. Mackinlay, and Ben Shneiderman; 
http://www.cs.umd.edu/hcil/pubs/books/readings-info -vis.shtml 

Visual Explanations, Envisioning Information, and The Visual Display of Quantitative Information, by Edward Tufte; 
http://www.edwardtufte.com 


Microsoft's Polyarchy Visualization reasearch — http://research.microsoft.com/users/marycz/chi2002poly.pdf 
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Calendar of High-Tech Events 


2002 


SEPTEMBER 24-26 Privacy 2002 — Cleveland, OH. This year's theme is Information, Security 
and New Global Realities as participants try to bridge the gap between the 
needs of business and government, and the concerns of consumers and priva- 
cy advocates. Register online, or contact Sol Bermann 1 (614) 688-4578; 
bermann@osc.edu; www.privacy2000.org. & 


SEPTEMBER 28-30 Telecommunications Policy Roundtable Conference — Alexandria, VA. The 
30th annual gathering of telecom and convergence policy wonks. For informa- 
tion call Latricia Allen-Thomas at 1 (301) 565-3371; info@tprc.org; 
www.tprc.org/TPRC02/2002.htm. 


SEPTEMBER 30-OCTOBER 2 Telecosm 2002 — Lake Tahoe, CA. George Gilder's annual event covering the 
future of convergence. For registration information, call Amanda Feszchur at 1 
(212) 367-3521; fax, 1 (212) 367-3514; afeszchur@forbes.com; 
www.forbes.com/conf/telecosm/. 


SEPTEMBER 30-OCTOBER3 O'Reilly Mac OS X conference — Santa Clara, CA. Explores how Apple’s 
rebuilt operating system is creating fertile ground for Mac Users. Register 
online; http://conferences.oreilly.com. 


OCTOBER 1-3 Streaming Media East 2002 — New York, NY. Co-located with Internet 
World Fall and touted as the largest Internet audio and video event. For more 
information, go to www.streamingmedia.com/east/ or call 1 (800) 632-5537 
(U.S.) or 1 (203) 559-2800. 


OCTOBER 1-3 Fall Internet World — New York, NY. The mother of all Internet trade shows. 
To register, call 1 (800) 632-5537, fax, 1 (203) 559-2814; 
www.internetworld.com/events/fall2002/. 


OCTOBER 7-8 Beyond the Numbers...The Mature Market Summit — New York, NY. 
J. Walter Thompson's second conference focusing new marketing 
strategies for targeting the 40+ market. Contact Nan McCann, 

1 (860) 724-2649 x13, nan.pme-events@snet.net; 
ww.frnht.com/clients_aquent/JWTSC/mmg_summit/. 


OCTOBER 7-10 Fall 2002 VON - Atlanta, GA. It's not just about voice, despite the name. The 
IP communications industry gets together to talk tech and do business. For 
more information, call 1 (631) 547-0800 or email von2002@pulver.com. 
www.pulver.com/von/. 


OCTOBER 9-11 Digital ID World Conference 2002 — Denver, CO. "Identity Crisis: Taming 
the Network" is the theme of the first major event designed to drive the emerg- 
ing digital identity industry. Register online or email sales@digitalworld.com 
for more information; www.digitalidworld.com/conference/2002. E 
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Calendar of High-Tech Events 


OCTOBER 13-15 


OCTOBER 14-17 


OCTOBER 16-18 


OCTOBER 18-20 


OCTOBER 20-23 


OCTOBER 22-23 


OCTOBER 30-31 


NOVEMBER 3-5 


Agenda 2003 - Phoenix, AZ. Lead by Jim Fallows, the 16th gathering of IT 
leaders meets to assess the industry's power in the world economy. and to dis- 
cuss what's on the agenda for 2003. Log on to request an invitation to the invi- 
tation-only event. For more information, call 1 (800) 633-4312; outside the 
US, +1 (650) 577-7801; www.idgexecforums.com/agenda. 


Microprocessor Forum 2002 - San Jose, CA. For 15 years, this has been the 
place to learn about groundbreaking disclosures and hear insightful analysis 
about significant microprocessor developments. Call 1 (480) 483-4441; fax, 1 
(480) 483-0400; www.mdronline.com/mpf. 


Privacy & Data Security Academy & Expo — Chicago, IL. The International 
Association of Privacy Officers invites you to come learn how to prepare your- 
self for the future of privacy. For more info, call 1 (800) 266-6501; fax, 1 (215) 
545-8107; info@privacyassociation.com; www.privacyassociation.org. 


Pop!Tech 2002 - Artificial Worlds — Camden, ME. A potent mix of vision- 
aries, technologists, policy makers, and venture capitalists join to focus on the 
intersection of technology and society in the brave new world. To register, call 
1 (800)673-4855 or register online; www.poptech.org. 


Direct Marketing Association Conference — San Francisco, CA. It's not 
about spam, it's about reaching customers — trust us! To register, call 1 (877) 
517-3040 (US only) or +1 (404) 584-7458; email thedma@ambassadors.com; 
www.the-dma.org/dmaannual/index.shtml. 


Jupiter Advertising Forum — New York, NY. Marketers and publishers gath- 
er to discuss innovations and trends in online advertising. To register, email 
registration@internet.com or call 1(203) 662-2857; 
www.intmediaevents.com/jaf/fall02/. 


East-Central Europe Conference — New York, NY. Partnering with East- 
Central Europe - Business and government leaders focus on IT and telecom 
infrastructure modernization and the roles played by international financial 
institutions, multinational corporations and national governments. Register 
online, or contact Gordon Feller, 1 (415) 491-4233; GordonF20@ATTbi.com; 
www.acteva.com/go/NYconference. 


EDventure's High-Tech Forum — Berlin, Germany. In its 12th year, 
Edventure’s High-Tech Forum is the premier event for technology entrepre- 
neurs, investors and vendors doing business in Europe. The theme this year is 
"The digital European: partner, customer, employee." Join us in Berlin! For 
registration details, visit www.edventure.com/htforum/ or contact Daphne Kis 
at daphne@edventure.com; 1 (212) 924-8800; fax, 1 (212) 924-0240. ØB 


Ø Events Esther plans to attend. 
[4 Events Kevin plans to attend. 


Lack of a symbol is no indication of lack of merit. The full, current calendar is available on our Website, www.edventure.com. 
Please contact Christina Koukkos (christina@edventure.com) to let us know about other events we should include. 
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See and be seen! Hear and be heard! 
EDventure’s High-Tech Forum is designed to encourage both attendees and speakers to share wisdom and 
experience. Join executives from global and European companies like Cisco, SAP, Tesco, BT, Atlas 
Ventures, Visa, Benetton and StepStone at this high-level workshop-style event. Do you have a product 
that needs broader reach? Do you want to extend your business to a new country? We'll introduce you to 
marketing partners from thoughout Europe. Are you an entrepreneur in search of funding? We'll help you 


meet VCs. You can find both advice and partners at the Forum. Take advantage of the early registration 


fee and register today at: http://www.edventure.com/htforum/. 


EDventure's High-Tech Forum 
November 3 to 5, 2002 


Berlin, Germany 


Release 1.0 Subscription Form 
Subscribe and join the other industry executives who rely on Release 1.0. You can also subscribe online at www.release1-O.com. 
Your annual Release 1.0 subscription costs $795 per year ($850 outside the US, Canada and Mexico), and includes both the print 


and electronic versions of 11 monthly issues; 25% off the cover price when you order from our online archives; a Release 1.0 


binder; the bound transcript of this year's PC Forum (a $300 value) and an invitation to next year’s PC Forum. 


NAME 


TITLE COMPANY 


ADDRESS 


CITY STATE ZIP COUNTRY 


TELEPHONE FAX 


EMAIL* URL 
*personal email address required for electronic access. 


My colleagues should read Release 1.0, too! 
Send me information about multiple copy subscriptions and electronic site licenses. 


Check enclosed Charge my (circle one): AMERICAN EXPRESS MASTERCARD VISA 


CARD NUMBER EXPIRATION DATE 


NAME AND BILLING ADDRESS 


SIGNATURE 


Please fax this form to Natasha Felshman at 1 (212) 924-0240. 


Payment must be included with this form. Your satisfaction is guaranteed or your money back. 


If you wish to pay by check, please mail this form with payment to: EDventure Holdings, 104 Fifth Avenue, 20th Floor, New York, 


NY 10011, USA. If you have any questions, please call us at 1 (212) 924-8800; email us@edventure.com; www.edventure.com. 
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By their nature, information visualization tools are better shown (in color!) than explained; in fact, 


some require that they be shown in color. Refer to the indicated pages to learn about the companies 
that developed the tools pictured below. 


Headline Ico Rape Nasdaq 1330.67 pow eee © © © Legend 


Map Control Panel 
Color key (% change) 


“3% 40% 43% 46% 


Health Care 


Capital Goods _ 


© No highlights 


Find (name or ticker) 


—_—Gonsumer-Gyclicals ——— Consumer Staples 


Figure 1: The Smart Money Market Map (PAGE 7 ). The colored rectangles are individual companies, the gray internal 
borders delineate sectors such as Energy or Transport, and the color of each rectangle indicates relative performance. 


CREDIT FIRST 
SUISSE | BOSTON 


View 
World Indices 


United States ` Western Europe 


Color 
Change iclose} 
Highlight 


wind Latin America 


Find 


Slovakia 


Slovenia |Hungary 
Show Instructions 


South Africa 


Figure 2: Panopticon's (PAGE 9) geographically organized Market Map for CSFB, showing relative performance 
of major world markets grouped by region. 
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Figure 6: Spotfire's DecisionSite interface designed for oil drilling (PAGE 21). 


Figure 9: "Like a peacock hit a windshield." Lumeta's map of the Internet, with 
different backbone networks color-coded (PAGE 25). 
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